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Abstract. A contraction analysis of risk-sensitive Riccati equations is proposed. When the 
state-space model is reachable and observable, a block-update implementation of the risk-sensitive 
filter is used to show that the JV-fold composition of the Riccati map is strictly contractive with 
respect to the Riemannian metric of positive definite matrices, when N is larger than the number of 
states. The range of values of the risk-sensitivity parameter for which the map remains contractive 
can be estimated a priori. It is also found that a second condition must be imposed on the risk- 
sensitivity parameter and on the initial error variance to ensure that the solution of the risk-sensitive 
Riccati equation remains positive definite at all times. The two conditions obtained can be viewed 
as extending to the multivariable case an earlier analysis of Whittle for the scalar case. 
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1. Introduction. Starting with Kalman and Bucy's paper [TJ, the convergence 
of the Kalman filter has been examined in detail, and it soon became clear that if 
the state-space model is stabilizable and detectable, the filter is asymptotically stable 
and the error covariance converges to the unique non-negative definite solution of a 
matching algebraic Riccati equation. However the classical Kalman filter convergence 
analysis is rather intricate and involves several steps, including first showing 

that the error covariance is upper bounded, next proving that with a zero initial 
value, it is monotone increasing, so it has a limit, and then establishing that the 
corresponding filter is stable and that the limit is the same for all initial covariances. 
In 1993, Bougerol [5] proposed a more direct convergence proof based on establishing 
that the discrete-time Riccati iteration is a contraction for the Riemmanian metric 
associated to the cone of positive definite matrices. Although this result attracted 
initially little notice in the systems and control community, this approach was adopted 
by several researchers [l6l - [T8l[20] to study the convergence of a number of nonlinear 
matrix iterations. We will use this viewpoint here to analyze the convergence of 
risk-sensitive estimation filters. Unlike the Kalman filter which minimizes the mean 
square estimation error, risk-sensitive filters |2ip23j minimize the expected value of an 
exponential of quadratic error index, which ensures a higher degree of robustness [TlllO) 
against modelling errors. Unfortunately, in spite of extensive studies on risk-sensitive 
and related H°° filters, results concerning their convergence remain fragmentary |23| 
Chap. 9], |10[ Sec. 14.6], [3 . In particular, one question that remains unresolved 
is whether there exists an a-priori upper bound on the risk-sensitivity parameter 
ensuring the convergence of the solution of the risk-sensitivite Riccati equation to a 
positive definite solution associated to a stable filter. 

The contraction anaysis presented in this paper relies on a block implementation 
of Kalman (risk-neutral) and risk-sensitive filters. When the system is reachable 
and observable and the block length N exceeds the number of states, it is shown 
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that in the risk-neutral case, the Riccati equation corresponding to the block filter is 
strictly contractive, which allows us to conclude that the Riccati equation of Kalman 
filtering has a unique positive definite fixed point. This analysis is equivalent to 
the derivation of [5] which relied on showing that the A^-fold composition of the 
Hamiltonian map associated to the risk-neutral Riccati operator is strictly contractive. 
However, it has the advantage that it can be extended easily to the risk-sensitive case 
by using the Krein-space formulation of risk-sensitive and H°° filtering developed 
in [5112]. With this approach, it is is shown that the TV-block risk-sensitive Riccati 
equation remains strictly contractive as long as a corresponding observability Gramian 
is positive definite. This Gramian is shown to be a monotone decreasing function of the 
risk-sensitivity parameter 9 with respect to the partial order of non-negative definite 
matrices. Accordingly, it is possible to identify a priori a range [0,Tff) of values 
of the risk-sensitivity parameter 9 for which the block Riccati equation is strictly 
contractive. This result is used to show that the risk-sensitive Riccati equation has a 
unique positive definite fixed-point, but because the image of the cone V of positive 
definite matrices under the risk-sensitive Riccati map is not entirely contained in V 
a second condition must be placed on 9 and the initial variance Pq of the filter to 
ensure that the evolution of the risk-sensitive Riccati equation stays in V. The two 
conditions obtained can be viewed as extensions to the multivariable case of those 
presented in [23l Chap. 9] for scalar risk-sensitive Riccati equations. 

The paper is organized as follows. The properties of the Riemann distance for 
positive definite matrices and of contraction mappings are reviewed in Section [2] The 
block- update filtering interpretation of the A^-fold Riccati equation of Kalman filtering 
is described in Section |3] and is extended to the risk-sensitive case in Section 01 This 
formulation is used to estimate the range of values of the risk-sensitivity parameter 
for which the risk-sensitive Riccati equation is contractive. A second condition on 
the risk-sensitivity parameter and initial condition ensuring that the solution of the 
Riccati equation remains positive is obtained in Section |5l An illustrative example is 
studied in Section |6l and conclusions as well as a possible extension are presented in 
Section [71 

2. Riemann distance and contraction mappings. Let V denote the cone 
of positive definite symmetric matrices of dimension n, and let P denote the cone of 
non- negative definite matrices forming the closure of P. If P is an element of P with 
eigendecomposition 

P = UAU^ (2.1) 

where U is an orthogonal matrix formed by normalized eigenvectors of P and A = 
diag {Ai, . . . , A„} is the diagonal eigenvalue matrix of P, the symmetric positive 
square-root of P is defined as 

pl/2 ^ [/Al/2yT 

where A^/^ is diagonal, with entries for 1 < i < n. Similarly, the logarithm of P 
is the symmetric, not necessarily positive definite, matrix specified by 

log(P) = C/log(A)C/^, 

where log(A) is diagonal with entries log(Ai) for 1 < i < n. Let P and Q be two 
positive definite matrices of V. Then P^^Q is similar to P^^^^QP^^^^ , so they have 
the same eigenvalues, and P~^/'^QP~^^^ is positive definite. Let si > S2 > . . . > s„ > 
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denote the eigenvalues of P sorted in decreasing order. The Riemann distance 
between P and Q is defined as 

" 1/2 

d(P,Q) = l|log(F-i/2gp-i/2)|| = (^(log(s,)f) , (2.2) 

i=l 

where 1 1 . 1 1 denotes the matrix Frobenius norm. In addition to having all the traditional 
properties of a distance, d has the feature that it is invariant under matrix inversion 
and congruence transformations. Specifically, if M denotes an arbitrary real invertible 
matrix of dimension n, 

d(P, Q) = d{p-^,Q-^) ^ diMPM^, MQM^) . (2.3) 

Furthermore, it was also shown by Bougerol [S] that the translation of P by a non- 
negative definite symmetric matrix is a non-expansive map. Specifically, 

d{P + S,Q + S)<^—d{P,Q) (2.4) 

a + p 

where a = max(Ai(P), \i{Q)) and (3 = A„(S'). In these definitions, it is assumed that 
the eigenvalues of P, Q and S are sorted in decreasing order, so that Ai(P) is the 
largest eigenvalue of P, i.e., its spectral norm, and A„(5) is the smallest eigenvalue of 
S. 

Recall that if /(•) is an arbitrary mapping of P, / is non-expansive if 

d{f{P),f{Q))<d{P,Q), 

and strictly contractive if 

d{f{P),f{Q))<cd{P,Q) 

with < c < 1. The least contraction coefficient or Lipschitz constant of a non- 
expansive mapping / is defined as 

Clearly, if / and g denote two non-expansive mappings, the contraction coefficient 
c(/ ° g) of the composition of / and g satisfies c(/ o g) < c{f)c{g), so if at least one 
of the two maps is strictly contractive, the composition is also strictly contractive. 
From inequality (|2.4p . we deduce that if ts(P) = P + S denotes the translation by a 
positive definite matrix S, ts is non-expansive, but the bound (|2.4[) does not allow us 
to conclude that c(rs) < 1, since when the largest eigenvalue of either P or Q goes to 
infinity, the constant a/ {a + (3) tends to one. 

The key result that will be used in this paper is that if / is a strict contraction 
of V for the distance d, by the Banach fixed point theorem [21 p. 244], there exists 
a unique fixed point P of f in P satisfying P — f{P). Furthermore this fixed point 
can be evaluated by performing the iteration Pn+i = f{Pn) starting from any initial 
point Po of V. Also if the A''-fold composition of a non-expansive map / is 
strictly contractive, then / has a unique fixed point. We will consider in particular 
the Riccati-type map over V defined by 

f{P) ^ M[P-'^ + n]-^M^ + W (2.6) 
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where P, fl and W are symmetric real positive definite matrices and M is a square 
real, but not necessarily invertible, matrix. For this mapping the following result was 
established in UHl Th. 4.41. 



Lemma 2.1. f is a strict contraction with 

Ai(Afrj-iAf^) 



<f)< 



A„(VK) + Ai(Am-iMT) 



(2.7) 



where we use again the convention that the eigenvalues of positive definite matrices 
are sorted in decreasing order. 

Note that although the results presented in this paper use the Riemann distance 
over V, other metrics such as the Thompson metric 

dT{P,Q) = max(Ai(log(p-i/2Qp-i/2)),Ai(log(Q-i/2pQ-i/2))) . (2.8) 

have similar properties [HUT], and the Thompson metric is in fact often used to 
analyze the convergence of nonlinear matrix iterations |17[|20j . 



(3.1) 
(3.2) 



3. Block update filter. Consider a Gauss-Markov state space model 



xt+i = Axt -I 
yt = Cxt 



But 



where the state Xt G M", the process noise Ut £ and the observation noise vt G M^. 
The noises Ut and Vt are assumed to be independent zero-mean WGN processes with 
normalized covariance matrices, so 



E 



Ut 
Vt 



[ 



where 



6t = 





■ I,n " 




Ip _ 


t = 





t ^ 



denotes the Kronecker delta function. The initial state vector xq is assumed inde- 
pendent of noises Ut and Vt and J\f{xo,Po) distributed. Since we are interested in the 
asymptotic behavior of Kalman and risk-sensitive filters, the matrices A, B, and C 
specifying the state-space model are assumed to be constant. Then if yt-i denotes 
the sigma field generated by observations y{s) for < s < t — 1, the least-squares 
estimate Xt = E[xt\yt-i] depends linearly on the observations and can be evaluated 
recursively by the predicted form of the Kalman filter specified by 



xt+i = Axt + KtVt 



where the innovations process 



A 

Vt = yt 



Cxt . 



(3.3) 



(3.4) 



In p.3p . the Kalman gain matrix 



Kt=APtC^{R'i)-\ 



(3.5) 
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where 

Ri = E[vtvJ] = CPtC^ + Ip (3.6) 

represents the variance of the innovations process, and if xt = xt — it denotes the 
state prediction error, its variance matrix Pt — E[xtxf] obeys the Riccati equation 

Pt+i = r(Pt) = AiPf' + C^C]-^A^ + BB^ (3.7) 

with initial condition Pq. This equation can also be rewritten in the equivalent form 
[IS P- 325] 

Pt+i = {A- KtC)Pt{A - KtCf + BB^ + KtKj , (3.8) 

which will be used later in our analysis. 

The Riccati mapping r{P) specified by p.7p has the form (|2.6p . Unfortunately the 
matrices C^C and BB^ are not necessarily invertible, so Lemma [2.11 is not directly 
applicable. Under the assumption that the pairs [A, B) and (C, A) are reachable 
and observable, respectively, Bougerol [5] was able to show that the rt-fold map r" is 
a strict contraction. This was achieved by considering the rt-fold composition of the 
symplectic Hamiltonian mapping associated to r (see [17| for a study of the contraction 
properties of symplectic Hamiltonian mappings). We present below an equivalent 
derivation of of Bougerol's result which relies on a block update implementation of 
the Kalman filter. 

The starting point is the observation that since Xt is Gauss-Markov, the down- 
sampled process xf = XkN with N integer is also Gauss-Markov with state-space 
model 

a;^+l=^M+7^^^uf (3.9) 
yf = 0^x^+vf +H^uf (3.10) 



where 



"fc — L '^kN+N-l ''^kN+N-2 ■ ' ■ ^kN J 

N FT T T 1 ^ 

y*; — L VkN+N-l ykN+N-2 ■ ■ ■ VkN \ 

N FT T T 1 ^ 

^k — y ""kN+N-l ^kN+N-2 ■ ■ ■ ""kN J 

In the model (p: ^ - ([XTU|) 

7^^^ = [ B AB ... A^-^B ] 

On = [ (CA^-^)^ ... {CAf f 

denote respectively the TV-block reachability and observability matrices of system 
()3.ip - ()3.21 where the blocks forming On are written from bottom to top instead of 
the usual top to bottom convention. If the pairs {A, B) and (C, A) are reachable and 
observable, TLn and On have full rank for N > n. In (I3.10p . if 



CA*-^B t > 1 
otherwise 
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denotes the impulse response representing the response of output yt in (13. 2p to the 
process noise Ut input in (13.11) , Hpf is the Np x Nm block Toephtz matrix defined by 



n 



N 



Hi H2 



Hi 







H2 

Hi 



N-2 



Hn-1 

Hn-2 
Hn-3 

Hi 




Note however that the noise vectors and = + Hnu^ are correlated since 



E 



NT ^,,NT 



Intu 



Inp + T-LnH 



4- 



This correlation can be removed by noting that the estimate of given takes 
the form 



where 



T \-l 



Then by premultiplying the observation equation p.lOp by TZnQn and subtracting it 
from p. 91) we obtain the new downsampled state dynamics 



N 



(3.11) 



with 



A 



aN — A - TZnQnOn , 

where the zero mean white Gaussian noise = — is now uncorrelated with 
observation noise w^, and has the invertible variance matrix 



The Kalman filter corresponding to the downsampled state-space model (|3.10p - 
p.lip can be interpreted as a block update filter, where the state estimate is updated 
only after a block of N observations has been collected. The Riccati equation corre- 
sponding to this Kalman filter is then given by 



where the n x n symmetric real matrices 



Wn 



(3.12) 



N 



(3.13) 



"AT 



(3.14) 
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are positive definite for > n whenever the pairs (C, A) and {A, B) are observable 
and reachable, respectively. In fact, flj^ and Wn can be viewed as observability and 
reachability Wronskians for the state-space model (I3.1l) - p.2l) . 

Prom Lemma [2.1[ we can therefore conclude that rd{-) is a strict contraction. 
However, since is the variance matrix of the one-step ahead prediction error for 
state XkN, fd coincides with the iV-fold composition of Riccati map r, which must 
have therefore a unique fixed point P in 'P. This establishes the following classical 
Kalman filter convergence result [I1II3] ■ 

Theorem 3.1. If in system h3.1\) - ^3^) the pairs {A,B) and {C,A) are reachable 
and observable, respectively, the algebraic Riccati equation P = r{P) admits a unique 
positive definite solution, and as t tends to infinity, for any positive definite initial 
condition Pq, Pf tends to P as t tends to infinity, and the Kalman gain matrix Kt 
tends to 

K ^ APC^[CPC^ + I]-^ 

which has the property that the matrix A — KC is stable. 

Given the fixed point P > 0, the stability of A — KC is obtained by applying the 
Lyapunov stability theorem to equation 

P^{A-KC)P{A-KCf + BB^ + KK'^ (3.15) 

(see [2 p. 80]). 

One unsatisfactory aspect of the contraction approach to the derivation of The- 
orem [XT] is its requirement that the system should be reachable and observable, in- 
stead of the weaker stabilizability and detectability conditions required by conven- 
tional Kalman filter convergence proofs pill3) . The stronger conditions conditions are 
needed to ensure that the Riccati evolution takes place entirely in the cone of posi- 
tive definite matrices. On the other hand, if the system is reachable and observable, 
the limit P is guaranteed to be positive definite, instead of just nonnegative definite 
under the usual assumptions. Finally, note that the block update implementation of 
the Kalman filter which was used here to show that rd = is a strict contraction 
is equivalent to Bougerol's derivation in [S], but as shown below it can be extended 
more easily to the risk-sensitive case. 

4. Contraction property of the risk-sensitive Riccati equation. For the 

state-space model (I3.ip - (l3.2p . the risk-sensitive estimate xt solves the exponential 
quadratic minimization problem |22[|23| 

i* = arg mmilog(p[exp(^||i?(a:t-aiP)|3^t-i]) , (4.1) 

where D G R"*^" with q < n \s assumed to have full row rank, and ||z|| = {z^z)'^/'^ 
denotes the Euclidean vector norm. The parameter 6 appearing in (|4.1|) is called the 
risk-sensitivity parameter. The resulting estimate xt obeys the recursion (|3.3|) - p.4p . 
where 

Kt = A{Ptf^ - eD^D)-^C'^{R'i)~^ (4.2) 

with 



(4.3) 
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and where Pt obeys the risk-sensitive Riccati equation 

P,+i = /(Ft) = A[P,-^ + C^C - 9D'^D]-^A^ 



BB' 



(4.4) 



Our analysis will use the fact that the risk-sensitive Riccati equation can be rewritten 



as 



Pt+i = {A- KtCm-' - eD^D]'\A - KtCf + BB^ + KtKf 



(4.5) 



The values 6 — 0^ 9 < Q and 6* > of the risk-sensitivity parameter correspond 
respectively to the risk-neutral, risk-seeking, and risk-averse cases. When 6* = 0, the 
risk-sensitive filter reduces to the Kalman fillter studied in the previous section, and 
when < Q the matrix C'^C — 9D^D is non-negative definite and can be rewritten 
as C^C where the pair formed by 



(- 



c 



and A is necessarily observable if (C, A) is observable. Accordingly, the convergence 
result of Theorem l3.1l is applicable to this problem, and in the remainder of this paper 
our attention will focus on the risk-averse case with 9 > 0. 

An interesting feature of the risk-sensitive filter is that it can be interpreted as 
solving a standard least-squares filtering problem in Krein space [Sl[5]. We will use 
this viewpoint here to extend the block filtering idea of the previous section to the 
risk-sensitive case. The Krein-space state-space model consists of dynamics (|3.1|) and 
observations p.2|) . to which we must adjoin the risk-sensitive observations 



(4.6) 



The components of noise vectors ut, vt and now belong to a Krein space and have 
the inner product 



(4.7) 



Ut 




Us 




Im 








Vs 




Ip 












-9-^Iq 





The iV-step observability matrix of the pair {D,A) is denoted as 



0§ = [ {DA^-')^ 



and if 



Lt 



DA^-^B 




t > 1 
otherwise 



denotes the impulse response from input ut to the risk-sensitive observation output, 
the corresponding A^-block Toeplitz matrix takes the form 



C 



N 











L2 








L 



N~2 



Ln-1 

Ln-2 
Ln-3 
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Then if 



the iV-block risk-sensitive observation for the downsampled process can be ex- 
pressed as 



= O^xi 



N 



(4.8) 



The Krein space inner product of observation noise vector 



11^ 



with itself admits the block LDU decomposition 



A 



JC' 



N 



Inp 


+ 




-0-^lNg 







[n 



N 



Ijvp 



Nq 



Inp + Ti-N'Hpf 




LNp 





N) 

Inq 



N 



(4.9) 



where 



A 



-9 ^iNq + CN{lNm + 'Hff'HN) ^ C 



IrT 

N 



(4.10) 



denotes the Schur complement of the (1,1) block inside /C^. The projection of 
noise vector on the Krein subspace spanned by the observation noise vector 

[ i'^k)'^ i^k^)'^ is then given by 



•N 



qO qRB 



AT 



,NR 



where 



and the residual 



— has for inner product 



N) 



A 



Nm 



N 



->R9 



N 



9 \T 
N) 
RB\T 



-1 



(4.11) 



The matrix will be positive definite if and only if 



(4.12) 



Note that this condition is also necessary and sufficient to ensure that the Schur 
complement Sf^ in (|4.10p is negative definite. Then by multiplying the observation 
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equation obtained by combining equations p.lOp and (|4.8p by T^jy [ Gn ^™ 



and 



subtracting it from (13.91) . we obtain the state-space equation 



(4.13) 



with 



A 



where the driving noise is now orthogonal to the noises w^^ and w^."' appearing m 
observation equations p.lOp and (|4.8p . Accordingly, the Riccati equation associated 
to the downsamplcd model takes the form 



RN 



pd 
^k+1 



(4.14) 



where 



On 



(4.15) 



with 



A 



£Nnj,[I + HNHl]-^ON 



and 



(4.16) 



For e 



0, the matrices flj^ and coincide with the risk- neutral Gramians fl^ 



and Wn defined in p.l3p and (I3.14p . These matrices are positive definite for N > n 
if and only if the pairs (C, A) and {A, B) are observable and reachable, respectively. 
Since is positive definite for < 6* < ^jv, we deduce that > over this range 
as long as {A, B) is reachable and N > n. On the other hand, the Schur complement 
matrix Sff is negative definite for < < ^at, so 



over this range. To establish that there exists a range < 9 < Tjy over which fi^ 
remains positive definite when (C, A) is observable, we use the following observation. 

Lemma 4.1. Over < 9 < On , the Gramians and are monotone de- 
creasing, and monotone nondecreasing, respectively, with respect to the partial order 
defined on nonnegative definite matrices. 



Proof. We have 



de 



and 



d_ 
d9^ 



!N 



d 

'd9' 



'N~T7i{Q%) ^Qn — Qn^Ji^nQn — 
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n 

To understand why and vary in opposite direction as 9 increases, note 
that can be viewed as a measure of the uncertainty introduced by the process 
noise in the state-space model, whereas 51^ is a measure of the information about the 
state contained in a block observation. As the risk-sensitivity parameter 9 increases, 
it is natural that the uncertainty matrix should increase and the information 
matrix 51^ should decrease. 

Let Tjv < 9]\f be the first value of 9 for which becomes singular. Then since 
il% and are positive definite for 9 G [0, tjv), we conclude that over this range 
the Riccati map is strictly contractive and has a unique fixed point P in 'P. Like 
the risk-neutral case, we have = (r^)^ . However, because the image r^{V) is not 
completely contained in P, to ensure that P is also the unique fixed point of r^, we 
must also require that r^{P) G V. Note indeed that if 

P^{rT{P). (4.17) 

by applying to both sides of (|4.17p . we obtain 

r%P) = {r^f{r^{P)) 

so r^{P) is a fixed point of (r^)^ . If r^{P) G P, we must have 

P = r\P) 

since {r^)^ has a unique fixed point in V. 

At this point it is worth pointing out that until now we have ignored an important 
constraint [SjIIO] for the risk-sensitive filter, namely that the matrix 

Vt = (Pr^ - 9D^D)-^ (4.18) 

should be positive definite for all t. If this condition is satisfied, then the fixed point 
P of will be in V, ensuring that it is the unique fixed point of r^. 

5. Positiveness conditions for Vt. In this section we identify conditions on the 
initial covariance Pq and risk-sensitivity parameter 9 which ensure that the trajectory 
of iteration Pt+i = r^{Pt) satisfies Vf > for all t. Our analysis will exploit the 
monotonicity of Riccati operator (P) with respect to the partial order of positive 
definite matrices. 

Lemma 5.1. Let Pi and P2 be two matrices in V such that Pi > P2 and Pi^ — 
9D'^D > 0. Then 

r'iPi) > r\P2) . (5.1) 

Proof. The monotonicity of is due to the fact that the inversion of positive 
definite matrices reverses their partial order. In addition, congruence transformations 
and translation by symmetric matrices preserve the partial order. Since the operator 
(P) in (14. 4p can be expressed in terms of two nested inversions of positive definite 
matrices, two matrix translations and a congruence transformation, it is monotone in 
P. □ 
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Next, observe that for any nxp observer gain matrix G, the risk-sensitive Riccati 
equation (|4.5I) can be rewritten as 

P,+i = {A~ GC){Pt-^ - eD'^D)-\A - GCf + GG'^ + BB'^ 
-[{A - GC)iPt-^ - eD'^D)-^G'^ - G]{Rl)-^ 

X [{A - GC){P^r^ ~ 0D^D)-^G^ - Gf . (5.2) 

This expression can be obtained by writing A = {A—GC)+GG in (j4.5p and performing 
simple algebraic manipulations. While it may appear surprising that a free matrix gain 
G can be introduced in the equation, the above modification has actually a simple 
explanation. Consider the state-space model p.ip - p.2p . We can always design a 
preliminary suboptimal observer 

if^,=Axf + G{yt-Gxf). (5.3) 

Then the residual if — Xt — xf admits the state-space model 

if^^ = {A- GG)if + But - Gvt 
yt - Cif = Cif + vt , (5.4) 

for which the only difference with respect to the original model p.ip - p.2p is that 
the process noise But — Gvt and measurement noise ut are now correlated. The risk- 
neutral and risk-sensitive problems associated to the original model p.ip - p.2p and 
modified model ()5.4p are exactly the same since observations yt and 

yf = yt- Gxf (5.5) 

can be obtained causally from each other. In particular, the variance matrices Pt 
of the error are the same for both models. Thus it should not be a surprise that 
the solution Pt of Riccati equation (|4.5p should also solve the risk-sensitive Riccati 
equation (|5.2p corresponding to modified model (|5.4p . 

One important advantage of introducing the free matrix gain G is that when the 
pair (C, A) is observable, the characteristic polynomial of the closed-loop observer 
matrix A — GG can be assigned arbitrarily [12]. In particular, it is possible to ensure 
that the matrix A — GG is stable, i.e. all its eigenvalues are strictly inside the unit 
circle. In this case, let 

r= max \XJA~GG)\ 

l<i<n 

denote its spectral radius. For p < 1/r, the matrix p{A — GG) will also be stable, and 
when {A, B) is reachable, the algebraic Lyapunov equation (ALE) 

Y.p = p^{A~ GG)Y.p{A - GCf + BB^ + GG'^ (5.6) 

admits a unique positive definite solution 

oo 

Sp ^J2p^\A - GCf{BB^ + GG^){{A - GCff . (5.7) 

k=0 

Note that is positive definite if and only if the pair A — GC, [ B G ] is reachable. 
But if this pair is not reachable, by the Popov-Belevich-Hautus (PBH) test [HI p. 
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366] , there must be a left eigenvector oi A — GC which is orthogonal to the column 
space of [ -B G ] , so 

z^{A - GC) = , z^B = z^G = . 

This implies z'^ A = Az^, so z'^ is a left eigenvector of A perpendicular to the column 
space of B, which implies that (A, B) is not reachable, a contradiction. 

If we select 1 < p < l/r, the matrix Sp is positive definite and the matrix 

AI = (1- p-^)Y.-^ -OD'^D (5.8) 
will be non-negative definite if and only if the matrix 

is non-negative definite. But because the matrices y}J^ DY}J^ and DYipD^ have 
the same nonzero eigenvalues, M is non-negative definite if and only if 

e^^D^pD'^ < /, (5.9) 

or equivalently 

where \i{DYipD^) is the largest eigenvalue of DYipD^ . It is strictly positive since Sp 
is positive definite and D has full row rank. 

Lemma 5.2. If the initial variance Pq for the risk-sensitive Riccati equation ^-4^ 
(or equivalently i5.2\) ) satisfies < Pq < Sp and < 9 < Pp, the entire trajectory 
of the recursion Pt+i = r^ (Pt) satisfies < Pt < Sp, so Vt > 0. Furthermore, for 
Pq = Yip, the sequence Pt is monotone decreasing. 

Proof. Suppose first that Pq — Sp. Then the non- negative definiteness of the 
matrix M in (j5.8p implies Vq > 0. By subtracting (15.21) for t = from ALE (|5.6p . we 
obtain 

Yp-Pi = (A~ GG){p'Yp - (S;i - eD'^D)-^){A - GCf 
+ [{A - GC)(S;i - OD^Dy^C^ - G] 

x(i?^)-i[(A - GC)(S;i - 9D^D)-'G^ - Gf . (5.11) 
But when M is non-negative definite, the matrix 

p'Yp-iY^'-eD'^D)-' 

appearing in the first term of the right hand side of (j5.1ip is non-negative definite, 
which implies 

Pi=r%Yp)<Po^Yp. (5.12) 
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By induction, suppose that < Pt-i- The motonicity of r imphes 

Pt+i </(Pt_i) =Pt 

so Pt is monotone decreasing. 

Next, consider the case of an initial condition Pq < Sp. The monotonicity of 
imphes 

Pi=r'>{Po)<r'{^p)<i:p 

where the last inequality uses (|5.12p . Proceeding by induction, we deduce that Pt < 
Sp for all t. This implies 

Pf-^ > > en'^D 

so Vf > for ah t. □ 

Remarks: 

1) For the risk- neutral case {6 = 0), the solution Sp of the ALE (|5.6p is similar 
to an upper bound proposed for the positive definite solution of the algebraic Riccati 
equation (ARE) in |^ (see also J15:), which was also shown to yield a monotone 
decreasing sequence of iterates. However the construction of the upper bound given 
in [B] is purely algebraic, whereas for p = 1 the covariance matrix Sp can be interpreted 
as the steady-state error variance of the suboptimal filter (15. 3p . 

2) Since the bound /3p for the risk-sensitivity parameter depends on both G and 
p, it is of interest to determine if a choice of G and p makes the bound as large as 
possible. Note in this respect that there exists a trade-off between making (3p as large 
as possible and enlarging the set < Pq < Sp of allowable initial conditions, since 
from (|5.9p in order to increase the range of 6 values, Ep must be as small as possible, 
which shrinks the domain of allowable PqS. A clue on how to select G is provided by 
the scalar case analysis presented in [23l Chap. 9]. With n = m = p = q = 1, ii we 
select the gain G = A/C, A — GC = so p can be selected arbitrarily large, and 

for all p. Letting p co in (|5.10p . the bound /3p then coincides with the scalar case 
bound derived on p. 116 of [53]. This suggests that selecting a gain G that moves 
all the eigenvalues of the closed-loop observer A — GC to zero is likely to yield a 
satisfactory upper bound /3p. Note that in the multivariable case, A — GC cannot in 
general be set to zero by selecting the gain matrix G, but the characteristic polynomial 
and some additional parameters (when p > 1) can be assigned arbitrarily fT2'> Chap. 
7]. Unfortunately, as will be demonstrated on an example in the next section, the 
gain G which assigns all the eigenvalues oi A — GC to zero does not necessarily yield 
the largest possible value of /3p and a comprehensive search over G and p is usually 
required to make /3p as large as possible. 

By assembling the preliminary results of the current and previous sections, we 
obtain the following convergence theorem for risk-sensitive filters. 

Theorem 5.3. Assume that in system \3.1]} - l[3^} . the pairs {A, B) and {C,A) 
are reachable and observable. Then if < < tn and < /3p with N > n, the 
risk-sensitive Riccati map r^ has a unique positive definite fixed point P such that 
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Furthermore, if the initial condition Pq of the Riccati equation 
Ep, the entire trajectory of iteration Pt+i = r^{Pt) stays in P, 



oo 



p-i - en'^D > 0. 

satisfies < Pq < 

satisfies Vt > and tends to P. In this case the limit K of filtering gain Kt as t 
has the property that A — KC is stable. 

Proof. Since the trajectory Pt stays in V and satisfies Vt > 0, and the TV-fold 
operator = (r^)^ has a unique fixed point P in P, the sequence Pt must tend to 
P, and P must be such that P^^ - OD'^D > 0. Then the stabihty of ^ - KC can be 
estabhshed by applying Lyapunov stability theory to the risk-sensitive ARE. 



P^{A- KC){P-^ - 0D'^D)-'^{A - KCf + BB^ 



□ 

This theorem answers in the affirmative the question posed in [Sj whether it is pos- 
sible to specify a-priori a range of risk-sensitivity parameters 9 and initial conditions 
such that the risk-sensitive Riccati equation admits a solution. On the other hand, 
it leaves open the computation of the maximum value of 9 (its breakdown value in 
the terminology of |23] ) for which a solution exists, which corresponds to the optimal 
fiher. 

6. Example. To illustrate our results, we consider a system with 



A = 



0.1 





1 

1.2 



C^[l -1] 



and B = D = 12- Note that A is unstable, {A, B) is reachable, but the pair (C, A) is 
barely observable, since the eigenvector 



P = 



1 

1.1 



corresponding to the eigenvalue A = 1.2 is at a 92.72 degree angle with respect to C. 
In this case, for iV = 2, the largest eigenvalue of matrix £2(^4 + ^^2^-2)^^ C2 equals 
1, so 92 = 2. To evaluate the value T2 for which ^2 becomes singular, the smallest 
eigenvalue of Gramian is plotted in Fig. 16. li as a function of 6* for < < 2 x 10""^. 
For this example, it decreases linearly, and becomes negative at T2 — 0.715 x 10"'^. 
For completeness, the smallest eigenvalue of reachability Gramian W2 is plotted in 
Fig. 16.21 over the same range of 9. It is monotone increasing, as expected, but the rate 
of increase is very small, since A2(VF|') varies from 1.002828 to 1.02831. Note that 
although we have selected N — 2 here, larger values of N can be considered, and in 
fact as N increases, r^v increases and 9n decreases, and for this example both values 
tend to 1.33 x lO'^ for large N. 

Next, to evaluate /3p, we observe that with the gain matrix 



G 



-13.1 
-14.4 



3.1) 



the closed-lood matrix 



A-GC 



13.2 
14.4 



-12.1 

-13.2 
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Fig. 6.1. Smallest eigenvalue of observability Gramian fi* for 0<9<2xl0 . 

is nilpotent, i.e., its eigenvalues are zero. Note however that G is rather large, which 
reflects the weak observability of the system. In this case, if we select p = 2, the 
solution T,2 of the Lyapunov equation 115. 6|) is 

1.4622 1.5954 " 
1.5954 1.7431 ' 

Its largest eigenvalue is Ai(S2) = 3.2042 x 10'^ and from (|5.10p . we obtain P2 = 
2.3407 X lO-'^. This bound is significantly smaller than T2. To illustrate Lemma [5.21 
the risk-sensitive Riccati iteration Pt+i — r^{Pt) is simulated with 6-/32 and initial 
condition Pq = S2. The two eigenvalues of Pt and Vt are plotted as a function of t 
for < i < 10 in Fig. 16.31 and Fig. 16. 4[ respectively. As expected, the eigenvalues 
remain positive and are monotone decreasing. The monotone decreasing property of 
the eigenvalues is due to the fact that if two nxn positive definite matrices P and Q 
are such that P > Q and if the eigenvalues of P and Q are sorted in decreasing order, 
then Ai(P) > Xi{Q) for 1 < i < n. In other words, the eigenvalues follow the partial 
order of positive definite matrices. Since according to Lemma 15.21 the sequence Pt 
is monotone decreasing, so are its eigenvalues. The figures indicate that the risk- 
sensitive Riccati equation converges very quickly, after 4 or 5 iterations. Note that 
if P denotes the limit of Pt, its smallest eigenvalue is 1.003, but the other eigenvalue 
is much larger and equals 332.4. This reflects our earlier observation that one of the 
modes of the system is barely observable. The eigenvalues of the matrix A — KC for 
the estimation error dynamics are 0.034 and 0.776, so the filter is stable, as expected. 

Finally, to illustrate the onset of breakdown as 9 increases, the two eigenvalues 
of the fixed point solution P^ of and of the corresponding matrix — {{P^)^^ — 
912)^^ are plotted as a function of 9 in Fig. 16.51 and Fig. 16.61 respectively, for < 
9 < 0.95 X 10~^. It is known [lOJ p. 379] that P^ is a monotone increasing function 
of 9, and as expected the eigenvalues of P^ are monotone increasing. However, while 




CONVERGENCE OF RISK-SENSITIVE FILTERS 



17 



1.0028 




Fig. 6.2. Smallest eigenvalue of reachability Gramian W2 for 0<6<2xl0 ^ . 

the change in the smaller eigenvalue is barely noticeable, the eigenvalue representing 
the weakly observable mode increases rapidly with 6. As 9 increases, the eigenvalues 
of start diverging, and the breakdown value of 9 for this example is just above 
0.95 X 10"'^. This value is significantly higher than the bound P2 obtained by applying 
Lemma [5.21 with the gain (jS.ip . suggesting that the bound can be improved. In fact, 
an exhaustive search over G and p showed that (3p is maximized by selecting 

_ r -7.2196 " 
~ [ -7.9753 

and p = 1.2849, in which case f3p = 0.4824 x 10"^. 

7. Conclusions. A convergence analysis of risk-sensitive filters has been pre- 
sented. It relies on extending Bougerol's contraction analysis of risk-neutral Riccati 
equations to the risk-sensitive case. This was accomplished by considering a block- 
filtering implementation of the iV-fold Riccati map and showning that this map is 
strictly contractive as long as an observability Wronskian depending on the risk- 
sensitivity parameter remains positive definite. A second condition was derived for 
the risk-sensitivity parameter and initial error variance to ensure that the trajectory 
of the risk-sensitive Riccati iteration stays positive definite at all times. The two 
conditions obtained can be viewed as multivariable versions of conditions obtained 
earlier by Whittle [23l Chap. 9] for the scalar case. 

Although the results we have presented concern filters with a constant risk- 
sensitivity parameter 9, a closely related class of robust filters was derived recently \W\ 
by assigning a fixed relative entropy tolerance to increments of the state-space model. 



18 



B. C. LEVY AND M. ZORZI 



3500 




1 2 3 4 5 6 7 8 9 10 

t 



1.1 



1.08- 
1.06- 

® 

1.04- 
1.02- 



1 2 3 4 5 6 7 8 9 10 

t 



Fig. 6.3. Eigenvalues of Riccati solution Pt for < f < 11 with 9 = (52 cii^d initial condition 
PO = S2. 

In this case, the risk-sensitivity parameter is time-varying, but the tolerance is fixed, 
and based on computer simulations, it appears that the risk-sensitivity parameter 
and associated filter always converge as long as the relative entropy tolerance remains 
small. Since Bougerol's analysis [5] is applicable to systems with random fluctuations, 
it is reasonable to wonder if the analysis presented here can be extended to establish 
the convergence of the filters discussed in [19] . 
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